Comparative Study of Differentially Private Synthetic Data Algorithms from the NIST PSCR Differential Privacy Synthetic Data Challenge

نویسندگان

چکیده

Differentially private synthetic data generation offers a recent solution to release analytically useful while preserving the privacy of individuals in data. In order utilize these algorithms for public policy decisions, policymakers need an accurate understanding algorithms' comparative performance. Correspondingly, practitioners also require standard metrics evaluating analytic qualities this paper, we present in-depth evaluation several differentially using actual sets created by contestants National Institute Standards and Technology Public Safety Communications Research (NIST PSCR) Division's ``"Differential Privacy Synthetic Data Challenge." We offer analyses based on both accuracy they create their usability potential providers. frame methods used NIST PSCR challenge within broader literature. implement additional utility metrics, including two our own, compare mechanism three categories. Our assessment synthesis quality shows relative usefulness, general strengths weaknesses, preferred choices metrics. Finally describe implications seeking future products.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

DPSynthesizer: Differentially Private Data Synthesizer for Privacy Preserving Data Sharing

Differential privacy has recently emerged in private statistical data release as one of the strongest privacy guarantees. Releasing synthetic data that mimic original data with Differential privacy provides a promising way for privacy preserving data sharing and analytics while providing a rigorous privacy guarantee. However, to this date there is no open-source tools that allow users to genera...

متن کامل

Di↵erentially Private Verification of Predictions from Synthetic Data

Di↵erentially Private Verification of Predictions from Synthetic Data by Haoyang Yu Program in Statistical and Economic Modeling Duke University

متن کامل

A Comparative Study of Some Clustering Algorithms on Shape Data

Recently, some statistical studies have been done using the shape data. One of these studies is clustering shape data, which is the main topic of this paper. We are going to study some clustering algorithms on shape data and then introduce the best algorithm based on accuracy, speed, and scalability criteria. In addition, we propose a method for representing the shape data that facilitates and ...

متن کامل

Gradually Releasing Private Data under Differential Privacy

Aggregating individuals’ data and computing statistics over a population are key ingredients to enable the Internet of Things [1]. Constructing traffic maps from individuals’ GPS traces [2] and performing demand response in smart grids [3], [4] are two examples that involve such data aggregation. Using these statistics, individuals can perform their activities more efficiently; they may choose ...

متن کامل

a study on insurer solvency by panel data model: the case of iranian insurance market

the aim of this thesis is an approach for assessing insurer’s solvency for iranian insurance companies. we use of economic data with both time series and cross-sectional variation, thus by using the panel data model will survey the insurer solvency.

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: The journal of privacy and confidentiality

سال: 2021

ISSN: ['2575-8527']

DOI: https://doi.org/10.29012/jpc.748